Speculative Multiprocessor Cache Line Actions Using Instruction History
نویسنده
چکیده
A technique is described for reducing miss latency in coherent-cache sharedmemory parallel computers. Miss latency is reduced by speculatively invalidating and downgrading (copying an exclusively held line back to memory) cache lines at one processor that might be needed at another processor. A line becomes a candidate for speculative invalidation when another line last accessed by the same instruction is invalidated. A line becomes a candidate for speculative downgrading under corresponding conditions. The technique can be implemented by constructing linked lists of lines for recent memory access instructions. The amount of memory needed by an implementation is little more than 11% the size of the cache. No time need be added to cache hits. In execution-driven simulations of such systems running programs from the SPLASH 2 suite invalidations and downgrades are reduced by 50% or more.
منابع مشابه
Speculative Multiprocessor Cache Line Actions Using Instruction and Line History
A technique is described for reducing miss latency in coherent-cache shared-memory parallel computers. Miss latency is reduced by speculatively invalidating and updating (copying an exclusively held line back to memory) cache lines at one processor that might be needed at another processor. A line becomes a candidate for speculative invalidation when another line last accessed by the same instr...
متن کاملSpeculative Multiprocessor Cache Line Actions Using Instruction and Line History 1
A technique is described for reducing miss latency in coherent-cache shared-memory parallel computers. Miss latency is reduced by speculatively invalidating and updating (copying an exclusively held line back to memory) cache lines at one processor that might be needed at another processor. A line becomes a candidate for speculative invalidation when another line last accessed by the same instr...
متن کاملFast Checkpoint/Recovery to Support Kilo-Instruction Speculation and Hardware Fault Tolerance
The increased relative cost of accessing memory is encouraging processor designers to explore deeper uniprocessor speculation (e.g., with branch and value prediction) and consider multiprocessor speculation (e.g., on coherence message types and values). While some mechanisms have been proposed to support deep speculation using speculative multithreading, current mechanisms for conventional proc...
متن کاملSecond - level Instruction Cache Thread Processing Unit Thread Processing Unit Thread Processing Unit Instruction Cache First - level First - level First - level Instruction Cache Instruction Cache Execution
This paper presents a new parallelization model, called coarse-grained thread pipelining, for exploiting speculative coarse-grained parallelism from general-purpose application programs in shared-memory multiprocessor systems. This parallelization model, which is based on the ne-grained thread pipelining model proposed for the superthreaded architecture 11, 12], allows concurrent execution of l...
متن کاملTraveling Speculations: An Integrated Prediction Strategy for Wide-Issue Microprocessors
Performing multiple, accurate, low-latency predictions is crucial to improving instruction throughput in future wide-issue microprocessors. However, demands of wide-issue processing coupled with implementation challenges posed by high clock frequencies present obstacles to these prediction goals. This paper proposes the Traveling Speculation framework to accommodate predictions in a wide-issue ...
متن کامل